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Abstract 

Let ti,... ,tn S and consider the location recovery problem: given a subset of pairwise 
direction observations {(ti — tj)/\\ti — d|2}i<je[n]x[n]i where a constant fraction of these obser¬ 
vations are arbitrarily corrupted, find {tpyi up to a global translation and scale. We propose 
a novel algorithm for the location recovery problem, which consists of a simple convex program 
over dn real variables. We prove that this program recovers a set of n i.i.d. Gaussian locations 
exactly and with high probability if the observations are given by an Erdds-Renyi graph, d is 
large enough, and provided that at most a constant fraction of observations involving any par¬ 
ticular location are adversarially corrupted. We also prove that the program exactly recovers 
Gaussian locations for d = 3 if the fraction of corrupted observations at each location is, up to 
poly-logarithmic factors, at most a constant. Both of these recovery theorems are based on a 
set of deterministic conditions that we prove are sufficient for exact recovery. 


1 Introduction 


Let T be a collection of n distinct vectors ... ,tn'^ E and let G = {[n],E) be a graph, 

where [re] = {1,2..., re}, and E = Eg U E),, with Ej, and Eg corresponding to pairwise direction 
observations that are respectively corrupted and uncorrupted. That is, for each ij E E, we are 
given a vector Vij, where 


Vij = 


fio) _ ^( 0 ) 

—A-T— for ij E Eg, Vij E for ij E Eb. 


( 1 ) 


Thus, an uncorrupted observation Vij is exactly the direction of relative to , and a corrupted 
observation is an arbitrary unit vector. Consider the task of recovering the locations T up to 
a global translation and scale, from only the observations {vij}ij£E, aud without any knowledge 
about the decomposition E = Eg \J Eb, nor the nature of the pairwise direction corruptions. 

A special case of this problem, with d = 3, is a necessary subtask in the Structure from Motion 
(SfM) pipeline for 3D structure recovery from a collection of images taken from different vantage 
points, a vital aspect of modern computer vision. In the SfM problem, camera locations and 
orientations are represented as vectors and rotations in with respect to some global reference 
frame. Given a collection of images, and for any point in there is a unique perspective projection 

of it onto each imaging plane. By building local coordinate frames around salient points in the 
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given images, based entirely on photometric information, and comparing them across images, one 
obtains an estimate of a set of point correspondences. That is, one obtains a set of equivalence 
classes, where each class corresponds to a physical point in 3D space. Given sufficiently many 
such sets of point correspondences, epipolar geometry and physical constraints yield estimates 
of the relative directions and orientations between pairs of cameras. Noise in these estimates is 
inherent to any real-world application, and worse yet, due to intrinsic challenges arising from the 
image formation process and properties of man-made scenes (illumination changes, specularities, 
occlusions, shadows, duplicate structures etc), severe outliers in estimated point correspondences 
and hence relative camera poses are unavoidable. 

Once camera locations and orientations are estimated, 3D structure can then be recovered by a 
process called bundle adjustment [24], which is a simultaneous nonlinear refinement of 3D structure, 
camera locations, and camera orientations. Bundle-adjustment is a local method, which generally 
works well when started close to an optimum. Thus, it is critical to obtain accurate camera location 
and rotation estimates for initialization. SfM therefore consists of three steps: 1) estimating relative 
camera pose from point correspondences, 2) recovering camera locations and orientations in a global 
coordinate framework, and 3) bundle adjustment. While the first and third steps have well-founded 
theories and algorithms, methods for the second step are mostly heuristically motivated. 

Several efficient and stable algorithms exist for estimating global camera orientations mum 
[I8l[7l[22l[llllll|8llll[ini[171|20]. Hence, it is standard to recover locations separately based on 
estimates of the orientations. 

There have been many different approaches to location recovery from relative directions, such 
as least squares piaiaiiTi, second-order cone programs and /oo methods [la [HI US IHEI], 
spectral methods |3|, similarity transformations for pair alignment [22|, Lie-algebraic averaging 
uni, markov random fields |5|, and several others [221 l25l [20l I12j . Unfortunately, most location 
recovery algorithms either lack robustness to correspondence errors (which are unavoidable in large 
unordered datasets), at times produce illegitimate collapsed solutions, or suffer from convergence to 
local minima, in sum causing large errors in or a complete degradation of, the recovered locations. 

There are some recent notable exceptions to the above limitations. An algorithm called IdSfM 
|28j focuses on removing outliers by examining inconsistencies along one-dimensional projections, 
before attempting to recover camera locations. However, one drawback of this method is that it 
does not reason about self-consistent outliers, which occur due to repetitive structures, commonly 
found in man-made scenes. Also, Ozye§il and Singer propose a convex program over dn + \E\ 
variables for location recovery and empirically demonstrate its robustness to outliers |19j . While 
both of these methods exhibit favorable empirical performance, they lack theoretical guarantees of 
robustness to outliers. 

In this paper, we propose a novel convex program for location recovery from pairwise direction 
observations, and prove that this method recovers locations exactly, in the face of adversarial 
corruptions, and under rather broad technical assumptions. To the best of our knowledge, this 
is the first theoretical result guaranteeing location recovery in the challenging case of corrupted 
pairwise direction observations. We also demonstrate that this method performs well empirically, 
recovering locations exactly under severe corruptions of relative directions, and is stable to the 
simultaneous presence of noise on all the observations, as well as a fraction of arbitrary corruptions. 

1.1 Problem formulation 

The location recovery problem is to recover a set of points in from observations of pairwise 
directions between those points. Since relative direction observations are invariant under a global 
translation and scaling, one can at best hope to recover the locations ..., up 
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to such a transformation. That is, successful recovery from {vij}(^ij)^E is finding a set of vectors 
{Q:(t® + rc)}jg[„] for some re E and a > 0. We will say that two sets of n vectors T = {ti,..., 
and are equal up to global translation and scale if there exists a vector w and a scalar a > 0 
such that ti = + w) for all i E [n]. In this case, we will say that T and have the same 

‘shape,’ and we will denote this property as T ~ The location recovery problem is then stated 
as: 


Given: G{[n],E), {vij}ij(zE satisfying ([I]) 

Find: T = {ti,...,t„} such that T ~ (2) 

For this problem to be information theoretically well-posed under arbitrary corruptions, the 
maximum number of corrupted observations affecting any particular location must be at most 

Otherwise, suppose that for some location , half of its associated observations Vij are consistent 
with and the other half are corrupted so as to be consistent with some arbitrary alternative 
location w. Distinguishing between and w is then impossible in general. Formally, let deg^(i) 
be the degree of location i in the graph {[n],Eh). Then well-posedness under adversarial corruption 
requires that max* degft(z) < qn for some 7 < 1 / 2 . 

Beyond the above necessary degree condition on Eg for well-posedness of recovery, we do not 
assume anything else about the nature of corruptions. That is, we work with adversarially chosen 
corrupted edges E^ and arbitrary corruptions of observations associated to those edges. To solve 
the location recovery problem in this challenging setting, we introduce a simple convex program 
called ShapeFit: 

n 

min y^^\\Py±{ti-tj )\\2 subject to - tj,Vij) = I, 'S^ti = 0 (3) 

‘ ^ J ijeE ij&E i=i 

where is the projector onto the orthogonal complement of the span of Vij. 

This convex program is a second order cone problem with dn variables and two constraints. 
Hence, the search space has dimension dn — 2, which is minimal due to the dn degrees of freedom 
in the locations {ti} and the two inherent degeneracies of translation and scale. 

1.2 Main results 

In this paper, we consider the model where pairwise direction observations about n i.i.d. Gaussian 
locations are given according to an Erdos-Renyi random graph. We start by showing that in a 
high-dimensional setting, ShapeFit exactly recovers the locations with high probability, provided 
that there are fewer than an exponential number of locations, and provided that at most a fixed 
fraction of observations are adversarially corrupted. 

Theorem 1. Let G{[n],E) be drawn from G{n,p) for some p = Q.{n~^P). Take ~ 

AA(0, Idxd) to he i.i.d., independent from G. There exists an absolute constant c > 0 and 07 = Ll{p^) 
not depending on n, such that i/max(^, ^log^re) < n < e'^'^ and d = H(l), then there exists an 
event with probability at least 1 — on which the following holds: 

For arbitrary subgraphs E), satisfying maxj degf,(f) < 771 and arbitrary pairwise direction corruptions 
Vij E forij E Eh, the convex program ([3]) has a unique minimizer equal to } ■ [ ] 

for some positive a and for t^^'* = ^ X]ie[n] 
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This probabilistic recovery theorem is based on a set of deterministic conditions that we prove 
are sufficient to guarantee exact recovery. These conditions are satisfied with high probability in 
the model described above. See Section IQ for the deterministic conditions. 

This recovery theorem is high-dimensional in the sense that the probability estimate and the 
exponential upper bound on n are only meaningful for d = n(l). Concentration of measure in 
high dimensions and the upper bound on n ensure control over the angles and distances between 
random points. As a result, lower dimensional spaces are a more challenging regime for recovery. 

Our other main result is in the physically relevant setting of three-dimensional Euclidean space, 
where for instance the locations correspond to camera locations. In this setting, we prove that exact 
recovery holds for any sufficiently large number of locations, provided that a poly-logarithmically 
small fraction of observations are adversarially corrupted. 

Theorem 2. There exists no E N and c G M such that the following holds for all n > uq. Let 
G{[n],E) be drawn from G{n,p) for some p = log^^^ n). Take tf‘\...tn^ G where 

tf^^ ~ Af { 0 , 13 x 3 ) o,re i.i.d., independent from G. There exists 7 = Ll{p^/log^n) and an event of 
probability at least 1 — ^ on whieh the following holds: 

For arbitrary subgraphs Eb satisfying maxj degf,(i) < jn and arbitrary pairwise direction corruptions 
Vij G for ij G Eb, the convex program ([3]) has a unique minimizer equal to } ■ [ ] 

for some positive a and for t^^^ = ^ X]je[n] 

Numerical simulations empirically verify the main message of these recovery theorems: ShapeFit 
recovers a set of locations exactly from corrupted direction observations, provided that up to a 
constant fraction of the observations at each location are corrupted. We present numerical studies 
in the setting of locations in with an underlying random Erdos-Renyi graph of observations. 
Further numerical simulations show that recovery is stable to the additional presence of noise on the 
uncorrupted measurements. That is, locations are recovered approximately under such conditions, 
with a favorable dependence of the estimation error on the measurement noise. 


1.3 Intuition. 

ShapeFit is a convex program that seeks a set of points whose pairwise directions agree with as many 
of the corresponding observations as possible. The objective, ~ incentivizes 

the correct shape, while permitting translation and a possibly-negative global scale. Each term 
IIT* x(fi — tj)\\2 is a length-scaled notion for how rotated ti — tj is relative to ±Vij. The objective 
is in this sense a measure of how much total rotation is needed to deform all {ti — tj}ij£E into the 
observed directions of {Evij}. Successful recovery would mean that — tjOlblijeE is sparse. 

'^3 

Motivated by the sparsity promoting properties of £i-minimization, the objective in ShapeFit is 
precisely the norm over the edges E{G) of these I 2 lengths. 

The first constraint in ShapeFit, Ylij&Ei^i ~ ~ i-; requires that the recovered locations 

correlate with the provided observations by a strictly positive amount. It prevents the trivial 
solution and resolves the global scale ambiguity. As opposed to the objective, this constraint 
forbids negative scalings of The second constraint, ~ resolves the global 

translation ambiguity. 
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1.4 Organization of the paper 

Section 11.51 presents the notation used throughout the rest of the paper. Section [2] presents the 
proof of Theorem [TJ Section [3] presents the proof of Theorem [2l Section d] presents results from 
numerical simulations. 

1.5 Notation 

Let [n] = {1,... ,n}. Let e* be the zth standard basis element. Let Kn be the complete graph on 
n vertices. Let E{Kn) be the set of edges in Let || • ||2 be the standard £2 norm on a vector. 
For any nonzero vector v, let v = u/||u|| 2 . For a subspace W, let Pw be the orthogonal projector 
onto W. For a vector v, let Py± be the orthogonal projector onto the orthogonal complement of 
the span of {u}. 

Let T denote the set T = for € M'^. Define tij = ti — tj for all distinct i,j G [n]. 

We define /Too = max^^j \\t\f\\ 2 , and we define J2ij&E{G) Define i = ^ X^ieN 

Define and similarly. For a scalar c, let cT = For a given G = G{[n],E) 

and {vij}ij^E, where Vij G have unit norm, let R{T) = J2ij^E ~ Let L(T) = 

~ ij^Vij). Let £ij = {ti — tj,Vij), and similarly for . In this notation, ShapeFit is 
nuni?(r) subject to L(T) = 1, t = 0 

For vectors vi,... ,Vk, let S{vi,... ,Vk) = span(ui ,... ,Vk) be the vector space spanned by these 
vectors. Given tij and t\^\ dehne 5ij, r]ij, and Sij such that 

£ij — (IT Vij^ij 

where Sij is a unit vector orthogonal to t\^'^ and rjij = ||T’,(o)xtij H 2 - Note that r]ij > 0. 

2 Proof of high dimensional recovery 

The proof of Theorem [1] can be separated into two parts: a recovery guarantee under a set of 
deterministic conditions, and a proof that the random model meets these conditions with high 
probability. These sufficient deterministic conditions, roughly speaking, are (1) that the graph 
is connected and the nodes have tightly controlled degrees; (2) that the angles between pairs of 
locations is uniformly bounded away from 0 and vr; (3) that all pairwise distances are within a 
constant factor of each other; (4) that there are not too many corruptions affecting any single 
location; and (5) that the locations are ‘well-distributed’ relative to each other in a sense we will 
make precise. Theorem [3] in Section 12.11 states these deterministic conditions formally. 

We will prove the deterministic recovery theorem directly, using several geometric properties 
concerning how deformations of a set of points induce rotations. Note that an infinitesimal rigid 
rotation of two points {ti,tj} about their midpoint to {ti + hi,tj + hj} is such that hi — hj is 
orthogonal to Uj = ti — tj. We will abuse terminology and say that \\P^i_ {hi — /ij)|| is a measure 
of the rotation in a finite deformation {hi,hj}, and we say that {hi — hj,ti — tj) is the amount of 
stretching in that deformation. Using this terminology, the geometric properties we establish are: 

• If a deformation stretches two adjacent sides of a triangle at different rates, then that induces 
a rotation in some edge of the triangle (Lemma [2]) . 
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• If a deformation stretches two nonadjacent sides of a tetrahedron at different rates, then that 
induces a rotation in some edge of the tetrahedron (Lemma [3|) . 

• If a deformation rotates one edge shared by many triangles, then it induces a rotation over 
many of those triangles, provided the opposite points of those triangles are ‘well-distributed’ 
(Lemma [3|). 

• A deformation that rotates bad edges, must also rotate good edges (Lemma [5]). 

• For any deformation, some fraction of the sum of all rotations must affect the good edges 
(Lemma E]). 

By using these geometric properties, we show that all nonzero feasible deformations induce a large 
amount of total rotation. Since some fraction of the total rotation must be on the good edges, the 
objective must increase. 

In Section 12.11 we present the deterministic recovery theorem. In Section 12.21 we present and 
prove Lemmas [IHSl In Section 12.31 we present and prove Lemmas HHS In Section 12.41 we prove the 
deterministic recovery theorem. In Section 12.51 we prove that Gaussians satisfy several properties, 
including well-distributedness, with high probability. In Section 12.61 we prove that Erdos-Renyi 
graphs are connected and have controlled degrees and codegrees with high probability. Finally, in 
Section 12.71 we prove Theorem [TJ 

2.1 Deterministic recovery theorem in high dimensions 

To state the deterministic recovery theorem, we need two definitions. 

Definition 1. We say that a graph G{[n],E) is p-typical if it satisfies the following properties: 

1. G is connected, 

2 . each vertex has degree between ^np and 2np, and 

3. each pair of vertices has codegree between ^np^ and 2np^, where the codegree of a pair of 
vertices i,j is defined as \{k G [n] : ik,jk G E(G')}|. 

Note that if G is p-typical, then its number of edges is between ju'^p and n'^p. 

Definition 2. Let T = C be a set of n vectors. Let G be a graph with vertex set [n]. 

(i) For a pair of vectors x,y and a positive real number c, we say that T is c-well-distributed 
with respect to {x,y) if the following holds for all h G 

ll-PspanT-x,t-y}i(^)ll2 > c\T\ ■ ||(/i) || 2 • 

teT 

(a) We say that T is c-well-distributed along G if for all distinct i,j G [n], the set Sij = {tk ■■ 
ik,jk G E{G)} is c-well-distributed with respect to {ti,tj). 

We now state sufficient deterministic recovery conditions on the graph G, the subgraph Ef, 
corresponding to corrupted observations, and the locations 

Theorem 3. Suppose Ef,,G satisfy the eonditions 

1. The underlying graph G is p-typieal, 

2. For all distinct i,j, k G [n], we have 1 — > fi, 
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3. For all i,j,k,£ with j and k^l, we have co||t ®||2 < 

Each vertex has at most en edges in incident to it, 

5. The set is ci-well-distributed along G, 

6 . All vectors are distinct, 

for constants 0 < p,/3,co,e, ci <1. If e < 3 . 256 ^ 64 . 32 > then L{T^^'l) / 0 and T(‘^)/L(T®) is the 
unique optimizer of ShapeFit. 

Note that Condition 3 implies that for poo = maxj^j ||tj-j^|| 2 ) we have cq/Too < < hoo for 

all distinct i,j G [n]. Also note that Conditions 1-6 are invariant under translation and non-zero 
scalings of . 

Before we prove the theorem, we establish that 7 ^ 0 when £ is small enough. This 

property guarantees that some scaling of is feasible and occurs, roughly speaking, when < 

\Ea\. 

Lemma 1. If e < tif, then L{Tl^i) / 0. 

Proof. Since Vij = t\^'^ for all ij G Eg, we have 

L(T)= 5 : (4»>.„„)> Ii4“>ib- Eii4“’ib- 

ij£E(G) ijeEg ijeEf, 

By Condition 3, coiaoo\Eg\ < Y.ij^Eg and Poo\Eh\ > Thus it suffices to prove 

that co\Eg\ > \Ei,\. As e < |, Condition 1 and 4 gives \Eg\ > ^n^p — en^ > \n^p. Since \Eif\ < en^, 
if e < then we have coliligl > \Eij\. □ 

The proof of Theorem [3] appears in Section [2.41 


2.2 Unbalanced parallel motions induce rotation 

Lemma 2. Let d > 2. Let ti,t 2 ,t 3 G be distinct. Let vi,V 2 ,V 3 G and a G M. Let {6ij} be 
such that {vi — Vj — atij,tij) = (iij||tij ||2 for each distinct i,j G [3]. Then 


\\PF.iVi-Vj)\\2> 

*J6[3] 

Kj 


1 — (tl2, ^ 23 )^ 11^12112 


5i2 — 6 


13 


Proof. Note that Lj = —tji and 5ij = 6ji for each distinct i,j G [3]. Define W = span(ti 2 , t 23 T 3 i) 
and define Wi = PwVi for each i. Note that 

- Vj)\\2 > WPt^iWi - Wj)\\2. 
i<j i<j 


The given condition implies P^±{wi — wj) = Wi — Wj — (a -|- 6ij)tij for each distinct i,j G [3]. 

'^3 

Therefore, 


Y,\\Ptf{'^i- Wj)\\2 = Ui - Wj - (a -F 6ij) Lj 


i<j 


i<j 


> 


E 


Wi - Wj - ( a + 5ii ) t 


(i,i)=(l,2),(2,3),(3,l) 

= 11 ^ 12^12 + 523^23 + ^ 31 ^ 31112 - 


ij I i-ij 
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Since + t 23 + ^ 31 ) = 0, the right-hand-side above equals ||(5i2 - ^ 13)^12 -|- (^23 - <^13)^23112- 

Furthermore, 


(hi2 — 5l3)tl2 + (^23 — ^13)^23 > min 11(^12 — ^13)^12 — ■5^23112 

2 sGK 

- ^ 13)^12 


> 


<^12 — <5i3 


C 12 


2V ^ “ ^^12,^ 23 )^- n 


The previous lemma is applicable only when two disproportionally scaled edges are incident to 
each other. The following lemma shows how to apply the lemma above to the case when we have 
two vertex-disjoint edges that are disproportionally scaled. 

Lemma 3. Let d>2. Let ti,t 2 ,ts,t 4 ^ G be distinct. Let vi,V 2 ,vs,V 4 G R'^ and a G R. Let {Sij} 

be such that {vi — Vj — atij , iij) = 5ij\\tij II 2 for each distinct i, j G [4]. Define fi = min — {iij,iik)‘^ 
where the minimum is taken over all distinct i,j,k G [4] except for the cases when {j, A:} = {1,2}. 
Then 

||T’jx(ui - Uj)||2 > ^||tl2||2 ^12- ^34 

*de[4] 
i<j 


Proof. Note that tij = —tji and 5ij = 5ji for each distinct i,j G [4]. Since the given conditions 
are symmetric under re-labelling of (1 and 2), and of (3 and 4), we may re-label if necessary and 
assume that ||ti 3||2 > max{||ti 4 || 2 , ||t 23 || 2 , ||A 24 || 2 }- By the triangle inequality, we have 2||ti3||2 > 
||Ai 3||2 + IIA 23 II 2 > ||Ai 2 || 2 - Apply Lemma[2]to the triangle {1,2,3} to obtain 


E 

*<i, *J6{1,2,3} 


~ '^j)\\2 ^ V ^ “ (^ 12 ,^ 23 )^ 11^12112 

'^3 


^12 — <^13 


> /5||A 


12||2 


^12 — ^13 


(4) 


and similarly apply the lemma to the triangle {3,1,4} to obtain 


i< 3 , *je{l,3,4} 


^13 — ^34 


> /3||Ai3||2|5i 3 — h34| > ^||tl2||2|^13 “ ^34|- 


(5) 


By adding ([1]) and ([5]), we see that 


E 

*<i> *je{i,2,3} 


\Pp.iVi-Vj)\\2+ ^ \\Pt^.{Vi-Vj)\\2 

ij '' Z—/ tj 

i<j, i,ie{l,3,4} 


> /3||il2||2 


^12 — ^13 


+ ^I|Ai 2||2 


^13 — ^34 


> f I|il 2 |l 2 


^12 — ^34 


The lemma follows since the left-hand-side is bounded from above by 2 jg[ 4 ] \\P^± {vi — Vj)\\ 2 . □ 

’i<j 






























2.3 Triangles inequality and rotation propagation 

Lemma 4 (Triangles Inequality). Let d > 3; x, y,ti,t 2 , - ■ ■ , tfc € IfT = {ti, • • • , tk} is c-well- 
distributed with respect to {x, y), then for all vectors hx,hy,hi, - ■ ■ ,hk G and sets X C [k], we 
have 

\\P{^x-uYhx - hi )\\2 + \\Pit^-y)x{hi - hy )\\2 > {ck - |X|) • ||P(^_y)x(/ix - hy)\\ 2 . 

i&[k]\X 

Proof. For each z G [k], define IFj = span(x — t^, ti — y). Define P as the projection map to the 
space of vectors orthogonal to x — y, and define Pi for each i G [k] as the projection map to 
Since (x — U)-^ D and {U — y)-*“ D W^, it follows that 

\\P[x-ti)-'-i^o: ~ hi)\\2 + \\P(^ti-y)-'-i^i ~ ^J/)I|2 

ie[fe]\x 

- Y - hi)||2 + ||-Pi(/li - /lj^)||2 > Y WPii^x - hy)\\2. 

ie[fc]\x ie[fc]\x 

Since fi, • • • , tfc are well-distributed with respect to (x, y), we have 

Y \\Piihx - hy)\\2 > ck ■ \\P{hx - hy)\\2. (6) 

ie[k] 

Since \\Pi{hx — hy)\\2 < \\Pihx — hy)\\2 holds for all i, it follows that 

^ \\P,{hx - hy)\\2 > {ck - |X|) • \\P{hx - hy)\\2, 

ielk]\x 

proving the lemma. □ 

The proof of Theorem [3] will rely on the following two lemmas, which state that rotational 
motions on some parts of the graph bound rotational motions on other parts. The following lemma 
relates the rotational motions on bad edges to the rotational motions on good edges. Recall the 
notation tij = {l + 6ij)t\^^ +'nijSij where Sij is a unit vector orthogonal to t\^^ and rjij = \\P,(o)±tij\\ 2 . 

Lemma 5. FixT. Ifeo<^, then EijeE.Vij > ^EijeE.Vij- 

__ _ __ _ 2 

Proof. For each edge ij G E{Kn), by Conditions [U [H [5l Lemma [Hand eo < we have 

Y + 'hjk) > f Cl • - 2eonj ■ ryj > ^np'^ ■ ryj. 

k^i,j 

ikJkGEg 

Therefore, if we sum the inequality above for all bad edges ij G Eb, then 

Y ■ 12 Pr 

ijeEi, k^i,j ij&Ei, 

ikJksEg 

For fixed ik G Eg, the left-hand-side may sum as many times as the number of bad edges 
incident to the edge ik. Hence by Condition 4, the left-hand-side of above is at most 

E E {rjik + Vjk) < 2eon- Y Pj- 

ijeEh k^i,j ij&Eg 

ikJkGEg 
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Therefore by combining the two inequalities above, we obtain 

ijeEi ^ ij&Eg 

The following lemma relates the rotational motions over the good graph Eg to rotational motions 
over the complete graph Kn- 

Lemma 6. FixT. Ifeo<^, then Y.ijaEg'nij > ^ %'• 

Proof. For each ij G E{Kn), since is ci-well-distributed along G and G is p-typical, we 

have as in the proof of Lemma O 

ivik + Vjk) > ( Cl • Irup^ - 2eon\ ■ rjij > ■ rnj. 

k^i,j ^ ^ 

ikJkGEg 

If we sum the above over all ij G E{Kn), we obtain 

i^ik + Vjk) > jnp'^ ■ 

ij&E{K„) kjti,j ij&E{Kn) 

ikJkGEg 

For a fixed ik & Eg, the left-hand-side may sum pik as many as times as the number of edges of G 
incident to ik. Therefore since G is p-typical, we see that 

(ilik + Vjk) < 2 • 2np Pij. 

ky^ij ij&Eg 

ikJkGEg 

By combining the two inequalities, we obtain 

^np^ Vij < 4rap 

ijeE(Kn) ij&Eg 

and thus ^ T.ij&E(K^) Vij < EijeEg Vij- □ 


2.4 Proof of Theorem [3] 

We now prove the deterministic recovery theorem. 


Proof of Theorem 0 By Lemma [T] and the fact that Conditions 1-6 are invariant under global 
translation and nonzero scaling, we can take t^^^ = 0 and = 1 without loss of generality. 

The variable poo is to be understood accordingly. 

We will directly prove that R{T) > for all T ^ such that L{T) = 1 and t = 0. 

Consider an arbitrary feasible T and recall the notation tij = (1 -|- + Vij^ij where Sij is a 

unit vector orthogonal to and pij = \\P,(o)±tij\\ 2 . A useful lower bound for the objective R{T) 
is given by 


= '^\\P^±tij\\2 = 


Y1 12 WKfMjh 

ij&Eg ij&Et, 


Y1 

ij&Eg ij&Et, 



R{T^^^)+ Y1 X] (I'5*illl4? lb TOO¬ 

L'S Eg ij&Ef, 


( 7 ) 
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Suppose that lb < %'• Since Lemma [5] for e < ^ implies Yij&Ef, Vij < 

i YijeEg by dZD, we have 

R{T) > /2(r(0))+ ^ r?,,- ^(|<5.,-|||tg^||2 + r?.,-) 

L'S-Bg ij&E}, 

> + ^ ^ 2riij > i?(r®). 

L'eEg L'S-Efc 

Hence we may assume 

I'iiillbSf lb > Y ( 8 ) 

ij€Eb ij&Eb 

In the case {Ej,] / 0, define S = j^^Yij^Eb bu'l average ‘relative parallel motion’ on 

the bad edges. For distinct edges ij,k£ G E{Kn), if {lj} H {k,f\ = 0, then define r]{ij,ki) = 
Vij + Vik + Vii. + Vjk + Vji. + Vm, and if {i, j} n {k,f\ ^ 0 (without loss of generality, assume I = i), 
then define r]{ij, ki) = r]ij + ijik + r]jk- 

Case 0. 5 = 0 or I Fife I = 0. 

Note that 5 = 0 implies 5ij — 0 for all ij € Fife, which by ([ 8 ]) implies rjij = 0 for all ij G Fife. 
Therefore by ([7]), we have 

R{T) > i?(rW) + ^ r,,j. 

ij&Eg 

If YijeEg Vij > 0) then we have R{T) > R{T^^'^). Thus we may assume that rjij = 0 for all ij G Eg. 

In this case, we will show that T = 

_ 2 

By Lemma m if eo < then rjij = 0 for all ij G E{G) implies that rjij = 0 for all ij G E{Kn). 

For ij G Fife, since 5ij = rjij = 0, it follows that lij = Since (^ijlbj-^^lb = ^ij ~ ^ij^ ^ 

we have 

0 = E (G - 4”’) = E (G - 4’) + E (G -4’) = E (g - 4’) = E ^«ii4ii^. 

ij&E{G) ij&Eb ij&Eg ij&Eg ij&Eg 

where the first equality is because F(T) = F(T(°b = 1. By Condition 6, |bj-°^|b / 0 for all i 7 ^ j. 

Therefore, if 5ij / 0 for some ij G Eg, then there exists ab,cd G Eg such that 5ab > 0 and 6 cd < 0. 

By Lemma [2] or [3] and Condition 2, this forces r/(a 6 , cd) > 0, contradicting the fact that rjij = 0 for 
all ij G E{Kn). Therefore 5ij = 0 for all ij G Eg, and hence 5ij = 0 for all ij G E{G). 

Define U = + hi for each i G [n]. Because rjij = 5ij = 0 for all ij G E{G), we have hi = hj 

for all ij G E{G). Since G is connected, this implies hi = hj for all i G [n]. Then by the constraint 

Z^j 6 [n] ti = Z]ie[n] get Fj = 0 for all i G [n]. Therefore T = 

Case 1. 5 ^0 and YijeEg I^L'I < Pl-^al |Fife| ^ 0. 

Define Ffe = {ij G Fife : > ^d}. Note that YijeEb\Lb 1%! ^ and therefore 

E I'^bl = E l^bl - E l-^bl > E I'^bl - l'^\Eb\ = Imi (9) 

ijeLb ij&Eb ij&Eb\Lb ij&Eb 
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Define Fg = {ij E Eg : \5ij\ < ^<5}. Then by the condition of Case 1, 

-6\Eg\ > \5ij\ > \5ij\ > -S\Eg \ Fg\, 

ij&Eg ijeEg\Fg 

and therefore |£'g \ Tg| < ^\Eg\, or equivalently, \Fg\ > ^\Eg\. 

For each ij E Lb and ki E Eg, by Lemmas [21 [3l and Condition 3, we have r]{ij,ki) > j\Sij — 
^ke\ ■ Wtijh > 4 • ■ Wtijh > Therefore by Condition 1 , 

E E-jw-m) s E E^i%i=Ei^»i'^N 

ij&Eb ki(^Eg ij^Lfjkl^Fg ij&Li 

ij&Lb 

where the last inequality follows from Q. For each ij E E(Kn), we would like to count how many 
times each ijij appear on the left hand side. If ij E Eb, then there are at most ( 2 ) K^s and n K^s 
containing ij] hence r]ij may appear at most 6 ( 2 ) + 3n = 3n^ times. If ij ^ Eb, then rjij appears 
when there is a K 4 or a containing ij and some bad edge. By Condition 4, there are at most 
2en such bad K^s. If the bad edge in K 4 is incident to ij, then there are at most 2sn • (n — 3) such 
K 4 S, and if the bad edge is not incident to ij, then there are at most \Eb\ < en^ such K 4 . Thus rjij 
may appear at most 3 • 2en + 6 • {2sn{n — 3) + en^) < 18en^ times. Therefore 


E E r]{ij,k£) < ^ 3 n^ • ^ ISen^-ruj. 

ij&Eb kieEg 


By Lemma m if e < we have 

via, ki) < 


ijeEt, kieEg 


Hence 


42e 

cip 2 


rn 


ij&Eb 

ijeE{Kn) 

Yi "ib- + yz 

18£n^ • pij < 

ij&Eg ij£E{Kn) 


y^ Vij> 32 1 ^ 3 !-<51^61. 


42£ 

Cip 2 


tU 


Y1 


ij&E(Kn) 


ij&E(K„) 

If e < |, then \Eg\ > ^ — \Eb\ > Further, if £ < ^ 2 ^ 2 ^ 32-8 ’ then by Condition 3, 5 / 0, and 
\Eb\ / 0, the above implies 

jjCQClP^ T| p I ^ Pcocip^ 1 T| p I 

/ ^ Vij — ^2 . — /fo 00 o ' _ ' 


ijGE(Kn) 


Lemma 0 implies 


32 T| I \ I r 

cip cip 

ij&Eb 


42 • 32 • 8 £ 
.( 0 ), 


d 0 )| 


yy — 26 yz ^ 11 ^/ 112 - 

ij&Eg ij&E(Kn) ij&Eb 

( 0 ), 


Therefore by (l8D,we have EijeEg h + Vij) if e < min{^, By 

this shows R{T) > R{T^^^). This condition on £ is satisfied under the assumption e < 
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for 


Case 2. 5 / 0 and \^ij\ > k^\^g\ \Eb\ ^ 0. 

Define -E+ = {ij € Eg : 6ij > 0} and E- = {ij G Eg : 5ij < 0}. Since tij — = ( 5 ij|| 4°^||2 

ij G Eg, we have 

0 = E = E(««-4“’)+E««ii4“’ii^' 

ij&E{G) ij&Ei, ijeEg 


where the first equality follows from L{T) = L{T^^')). Therefore, 


E 

h0)|| 

ij 2 

< 

E««-4”’) 

ij&Eg 



ij&Eb 


< Y1 (.\^ij\\\4fh+r]ij) < 2nooS\Eb\, 

ij&Et, 


where the last inequality follows from ([ 8 ]), Condition 3, and the definition of 8. On the other hand, 
the condition of Case 2 and Condition 3 implies YlijeEg ^ |co/Uoo(5|-Eg|- Therefore 


E (-'5«)ii4“’ib 

ij&E. 


■E 

ijeEg 


<5--Ilf® I 


+ X] 1 ^' 

ijeEg 


do) I 
^J\\\hj 


|2 I > 


1 4i 


2 V8 


y^0f^OoS\Eg\ 2>llQQ6\E}y\ 


If £ < ^cop, then since < en^ and \Eg\ > — \Efy\ > |n^p, we see that ^cofiooS\Eg\ — 

2^ioo8\Eb\ > ^coHoo8\Eg\. Therefore lb > :^coHoo8\Eg\. Similarly, Eije£;+lb > 

I ■ 

If |£’+| > then by Lemmas [2l [3l and Condition 3, we have 


ij&E- M£E+ 


> E E ^(-s,nCh 

ij&E- M&E+ 

> E (-<^*i)lli°^ll2 • f l^+l > j\E+\ • Lco/roo5|i))5 

ij&E. 


Similarly, if |i4-| > \\Eg\, then we can switch the order of summation and consider Yhij^E^ YlkieE^ kt) 
to obtain the same conclusion. 

Since each edge is contained in at most copies of ^4 and n copies of K^, (and there are 

6 edges in a K^, 3 edges in a K^), we have 


E E +3n^ E Vij<^n‘^ E 

ijeE- kieE+ ^ ' ij&E{K„) ij&E{K„) 


If e < f, then |£'g| > jn‘^p—\Ei,\ > Further, if e < 3 . 256 ^ 64.32 1 tlien since <5/0 and \Eb\ < en^, 

we have 

E % 

ij&E{Kn) 


1 /3co/ioo<5|p |2 ^ ^ 2 ^ 32 - 

3n2' 256 ’ - 3-256 


By Lemma [ 6 l if e < 


ClP" 


then this implies 


El — j^g El ^ 2 /roo<I|II' 6 | • 

ijeEg ij£E(K„) 
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Therefore from (HI), ([SI), and Condition 3, if e < min{^, 3 . 256 ^ 64 . 32 }; then 

R{T) > i?(r(0))+ ^ r?,,- j;(|5*,|||t®||2 + ^?*,) 

ijSEg ij^Ef} 

> R{T^°y) + 2f,^S\Eh\ - Y, 2\5ij\\\t\f\\2 > i?(r(o)). 

ij&Et, 

This condition on e is satisfied under the assumption e < 3 . 255 ^ 64.32 ■ 


□ 


2.5 Properties of Gaussians in high dimensions 

In this section, we prove that i.i.d. Gaussians satisfy properties needed to establish Conditions 2, 

3, and 5 in Theorem [3l We begin by recording some useful facts regarding concentration of random 
Gaussian vectors: 

Lemma 7. Letx,y be i.i.d. J\f {0, Idxd), cind e < 1, then 

P ((i(l - e) < \\x\\l < d{l + e)) > 1 - 

and 

^{\{x,y)\>de)<e-^"^ 

where c> 0 is an absolute constant. 

Proof. Both statements follow from Corollary 5.17 in 123 , concerning concentration of sub-exponential 
random variables. □ 

Lemma 8 f [27j Corollary 5.35). Let A be an n x d matrix with i.i.d. M{0, 1) entries. Then for 
any f > 0, 

p (^crmax(^) > Vn + \fd +-^ < 2e~^ 
where (Tniax(^) is the largest singular value of A. 

Lemma 9. Let tf'\i G [n] be i.i.d. M{0,ldxd)- Then, there exists an event E, such that on E, we 
have for all i,j, k,l € [n\,i ^ j,k ^ I, 


and for all distinct i,j,k G [n], 
and ¥{E^) < 3n^e“'^'^, where c> 0 is an absolute constant. 

Proof. This follows from repeated application of Lemma [7] with e = 1/100 and a union bound. □ 

We can now show that gaussian vectors have the well-distributed property with high probability. 
Recall that S{x,y) = span(x,y). 

Lemma 10. Let ti,...tn G be i.i.d. Af{0,ldxd), and let n > 16 and d>3. For a fixed k^l, 
the inequality 

holds for all /i G with probability of failure at most 5ne“'^‘^, where c > 0 is an absolute constant. 


\hj II2 

|h0)|| 
rki II2 


>1 
“ 10 


< 1/3 
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Proof. Throughout the proof, constants named c may be different from line to line, but are always 
bounded below by a positive absolute constant. For a fixed {I, k), let x = ti,y = tk- We would like 
to show 

n ^ 

^ ^ ll-^5(x—ti)-L (^) II 2 ^ (^) II 2 

i=l 

We note that S{x — ti,y — U) = S{x — y,x + y — 2ti). Thus, 

^S{x—y,x+y—2ti)-^{^) ^S{x—y,x+y—2ti)-^(.^S{x—y)-^(.^)') 

Thus, it’s enough to show 

n ^ 

\\Psix-y,x+y-2Upih)\\2 > -n\\h\\2 

i=l 

for h -L {x — y). 

Now, for any vectors v, w, we have 


S{v, w) = S{v, Wy±) 

where w^± = w — {w, v)v. If h T u, we have 

= h — {h, v)v — {h, w^i_)w^i_ 


W^x\\2 W^„x ||2 


W.^±\\2 ||t(^„x||2 


Where z = w — 




II«'„X||2 I|U’„X||2 

|2 _ ||„„ „'',\X,I |2 


= h — {h, w)w + {h,w) I re — 

= Psiwpih) + {h,w)z 

. Now, assuming that |(i),u))| <1/2 and using that 


= Ik - {'w,v)v\\i = WMli - 2|kll2(w'k) + lkll2K^k)l > Ikll2(i - 2Kwk)l) 


we have 


1^^112 = 


W 2 I , 

w — T. - — yw — \w, v)vj 


\w. 


w 


1 - 


v-^ 112 

kill 


W. 


< 


1 - 






112 


+ 


kill 


w. 


{w, v)v 


112 


W. 


d-L 112 




W, 


rlkk)l = e((w'k)) 


D-L II 2 


w, 


(1 + I(u;,i))|) - 1 


< 


t>-L 112 

3|k,kl A 


= C{{w,v)) 


Thus, we have 


1 - 2 |(u;,{))| 

\Ps{v,wpih)\\2 > ||P5(^)x(/l)||2 - C((wk))kl|2 
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Therefore, by taking v = x — y and w = x + y — 2ti, to conclude the desired statement of the present 
Lemma, it suffices to show that 


\\Ps{x+y- 2 U)^ih )\\2 > 'yn\\h \\2 

i=l 


where 7 > 1/5 + C ^ that x — y and x + y — 2ti are independent, and 

- y) ='^ \{x + y - 2ti) ='^ A^(0, Idxd)- Applying Lemma [7] to x — y and x + y — 2ti with a small 


enough value of e to ensure C 


{x-y,x + y + 2ti) \ 
V \\\x - yhWx + y - 2 ti\\ 2 ) 
Thus, it suffices to show with high probability, that 





Y \\Ps(x+y-2U)x{h)\\2 > 0 .3n||h||2, 
i=l 


which we proceed to establish below. 

To begin, redefine v,w as v = x + y and w = —2ti and consider 




i=l 


> 


Ps{v+Wi)X (h) 


2 = 1 


EL 


2=1 


7 - - — ^{h,v + Wi){v + Wi) 

\V + Wi\\2 


> n 


> n 


^ U + WiW^ 
1=1 


{v + Wi){v + Wi)*h 


E 


1 


> IKdL 


where in the last inequality we used 


n — 


^ It + 

1 

miuj llu + iCi II2 


op 


^(v + Wi)(v + Wi) 
n 

YX'^ + + Wi) 


i=l 


opj 


n ^ 

Y. Ti—^-1T2 - 


1 


miuj ||u + tCjlln 


Yix + Wi){v + Wi)* 
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Now, let A = X]r=i ^ We have 


y^Xv + Wi){v + WiY 


1=1 


V i=l 


yy^{vv* + VW* + WiV* + WiW*) 
n \ * / r 

<n||OT*||oD+ + ( y~! ^ 

\i=l 
n 

E* 

i=l 
n 

^ ^ Wj + O'niax(^) 


< n\\v\\l + 2||u||2 
= n\\v\\l + 2||u||2 


+ 


op 




i=l 


op 


+ 


WiW* 

i=l 


i=l 


Thus, 


Y \\^siv+wi)^{h) 2 > \\h\\2 

i=l 


n — 


n\\v\\l + 2||i;||2 \\Yh= 1 Wi\\2 + Crmax{A) 


2 n 


miiij ||u + Wi \\2 


Now, consider the event 


E=< 

min u + Wig > 6d/3i, 

\\v\\l < 2dl32, 

n 

Y^i 

' ] 

< indPs, fTmax(^)^ < nl34^ 




2=1 

2 J 


On E, we have 

n 

E 


2=1 


> IKhl2 


n — 


k 


6d/3i 


+ 


= IK'-II2 


1 /32 4:\/2dy/n\f^2Xz 

'fi — _77_ — _ — _77 

3 /3i 6dl3i 6d/3i 


= K'' 2 


n i — 


^ _ Xkl _ _^4\/2V^2^\ 

3 /?! 6d/3i y/n 6/3i j 


Now, note that |||u + Wi \\2 ='^ ik IIZ]r=i^*ll 2 X^id) and ^4 is a random n x d 

matrix with i.i.d. M{0, 1 ) entries. 

Thus by applying Lemma [7] we have 


P {6d{l - e) < \\v + Wig < 6 d(l + e)) > 1 - 
P (2d(l - e) < \\v\\l < 2d{l + e)) > 1 - 


P 4nd{l — e) < 


E 

2=1 


Wi 


< 4n(i(l + e) 1 > 1 — e 


— ce'^d 
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where c > 0 is a universal constant. Also by taking t = \/M in Lemma [ 8 ] we get 

p (crmax(A) > Vu + 2yf(tj < 

Now, let /?! = 1 - /32 = /33 = 1 + 1 ^, Pa = f, which gives 

i^< 1/3 + 1/99, ^ = 1/10 

o Pi yn bpi yn 5a 

We have 

p (o-max(A) > < P (^(Tmax(A) > y/n + 2yf(tj < 2e~'^ 

whenever ^/n + 2^/d < 'Jn^dj2, which holds whenever 

2Vd y 

which holds for n > 16 when d> 3. Since for n > 16, < 1/4, we have on E 

n 

Y1 ^ 0.3n\\h\\2 


2=1 


Thus, 


< 0 . 3 n||/i ||2 <P(ii;^)<(n + 3)e 


—cd 


^ 2=1 


where c > 0 is an absolute constant. 
Combining all of the above, we get 


2=1 


2 > -n\\Psi^^_yy 


with probability of failure at most 5ne 


—cd 


□ 


Lemma 11 . LetG{[n],E) hep-typical, and ti,... ~ A^(0, I^xd) be i.i.d. Then T = {L}ig[n] is 

^-well distributed along G with probability at least 1 —10n^e“'^‘^, where c > 0 is an absolute constant. 

Proof. For each ij G E, let S'y = {k £ [n]]ik,jk G E{G)} and note that |5jj| < 2np‘^. Now 
apply Lemma do] to the set of vectors {L, tj} > with the distinguished vectors being 

{ti,tj}, which gives the desired property for the pair (i,j) with probability of failure at most 
5{\Sij\)e~'^'^ < 5(2np^)e“'^'^, where c > 0 is an absolute constant. Taking the union bound over 
pairs of distinct integers i,j G [n], we get the desired property simultaneously for all pairs with 
probability at least 1 — ■ b{2np^)e~‘^ = 1 — 10 n^p^e“'^'^ > 1 — 10 n^e“'^‘^. □ 
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2.6 Random graphs are p-typical with high probability 

Lemma 12. There exists an absolute constant c > 0 such that for all positive real numbers p < 1, 
G{n,p) is p-typical with probability at least 1 — if np > 41ogn. 

Proof. A graph is not connected only if there exists a partition lA U V 2 of its vertex set for which 
there are no edges between Vi and V 2 . Without loss of generality, we may assume that |Vi| < [§J • 
Since the number of ways to choose a set of size k from a set of size n is , the probability that 
G{n,p) is not connected is at most 


Ln/2J , [n/2j 

^ pVl^ 

k=l k=l ^ 


< 


Ln/2J 

E 

k=l 


ne 


l—p{n—k) 


Since k < we have < 1 (since np > 41ogn). Therefore the summand 

on the right-hand-side is at most which is maximized at A: = 1. This shows that the 

probability that G{n,p) is not connected is at most 

In G{n,p), for a fixed vertex v, the expected value of deg(u)is (n — l)p, and for a pair of vertices 
V, w, the expected value of the codegree of v and re is (n — 2)p^. Therefore the lemma follows from 
Chernoff’s inequality — see Fact 4 from [T] — and a union bound. □ 


2.7 Proof of Theorem [T] 

We can now prove the high dimensional recovery theorem, which we state here again for convenience: 

Theorem 1. Let G{[n],E) be drawn from G{n,p) for some p = Take tf‘\...tn'^ ~ 

AA(0, Idxd) to be i.i.d., independent from G. There exists an absolute constant c > 0 and 07 = Ll{p^) 
not depending on n, such that if ^log^re) <n< e'^'^ and d = n(l), then there exists an 

event with probability at least 1 — on which the following holds: 


For arbitrary subgraphs Ef, satisfying max* deg^(i) < yn and arbitrary pairwise direction corruptions 
Vij G forij G Fife, the convex program ([3|) has a unique minimizer equal to | 

for some positive a and for ^ X]je[n] 


n 


Proof. It is enough to verify that G, T and Eh in the assumption of the present theorem satisfy the 
deterministic conditions 1-6 in Theorem 2, with appropriate constants p, /3, cq, e, ci, and with the 
purported probability. By Lemma fl^ Lemma El and Lemma fTTl we have that Condition 1 holds 

with value p. Condition 2 holds with jd = Condition 3 holds with cq = ^, and Condition 5 
holds with Cl = |, with probability at least 


1 - - 3n‘^e-^‘^ - lOn^e-'^'^ 


where c > 0 is an absolute constant. ^ ^ 

Thus, taking any Eh , which satisfies Condition 4 with 7 = ^ < 25^^32- 64 - 3 ’ that 

recovery via ShapeFit is guaranteed. Note that the condition maxdegfe(f) < yn is nontrivial 
when p = 12 ( 71 “^/“^). Using the requirements on n and p, we have 

and 13n^e“'^'^ < 13(e6‘^'^)^e“'^‘^ < 13e“2^‘^. Thus, the probability of exact recovery via ShapeFit, 
uniformly in Eh and Vij satisfying the assumptions of the theorem, is at least 

1 _ - 136 -^'^'^. □ 
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3 Proof of three-dimensional recovery 

The proof of recovery in three dimensions parallels the proof in high dimensions, but it is more 
technical because it can not capitalize on the concentration of measure phenomenon in high di¬ 
mensions. Specifically, the additional technicality in three dimensions comes from the fact that for 
large n, there exist pairs of locations tf‘\ that are close to each other, i.e., ||t-j ^||2 is small. For 

such pairs of vectors, with high probability, for all k ^ i,j the value of 1 — will be small. 

This fact introduces the following two main obstacles in carrying out the same analysis: 

1. There is no uniform lower bound on 1 — ■ Hence Condition 2 in Theorem [3] fails. 

2. There is no uniform lower bound on Hence Condition 3 in Theorem [3] fails. 

These are indeed obstacles since the gains in rotational motions coming from Lemmas [2] and [3] 
are proportional to ~ i^ik We avoid these difficulties and prove the three- 

dimensional analogue of Theorem [3] by weakening Conditions 2 and 3. Roughly speaking, in 
Condition 2 holds for most triples i,j,k G [n] (instead of all triples) and Condition 3 gets replaced 
by a one-sided version where we only have a uniform upper bound on the lengths ||t^^^|| 2 - 

Unlike in the high-dimensional case where we allowed a constant fraction of edges incident to 
each vertex to be corrupted, the three-dimensional case requires the fraction of corrupted edges 
incident to each vertex to be at most ^ ). This additional poly-logarithmic factor is due to 

the fact that our well-distributedness proof in three dimensions hinges on the maximum £2 norm 
of locations, which is H(yTog^) with high probability. It can be removed for a distribution of 
locations that has a uniform constant upper bound on ||t|*^^|| 2 . 

3.1 Deterministic recovery theorem in three dimensions 

We now state deterministic conditions on the graph G, the corrupted observations Eb, and the 
locations that guarantee recovery. Recall the definition /r = |£;(G)| ^ijeE{G) ll^/lb- 

Theorem 4. Suppose T^^\Eb,G satisfy the conditions 

1. The underlying graph G is p-typical, 

2. For all distinct i,j G [n], for all but at most ein indices k G [n] satisfying k 7 ^ i,j, we have 
1 - {iij,iik)‘^ > /32 and 1 - (iijjjk)'^ > (3“^, 

3. For all distinct i,j G [n], we have ||tj °^||2 < co/r, 

4 . Each vertex has at most Squ edges in E^ incident to it, 

5. The set is ci-well-distributed along G, 

6. No three vectors t\^\t^^\t^k'^ are collinear for distinct i,j,k. 

for constants 0 < p,/?,£ 0 , ei, ci < 1 < cq. If Eq < 32 . 3 . 62 i 024 c§ L(T^°i) / 0 and 

T^^'l/L{T^^'l) is the unique optimizer of ShapeFit. 

Note that all six conditions are invariant under translation and non-zero scalings of T^^i (Condi¬ 
tion [3] is invariant since both and pL scale together and are invariant under translation). Before 
we prove the theorem, we establish that L(T^^^) 7 ^ 0 when eq is small. This non-equality guarantees 
that some scaling of T^^'i is feasible whenever, roughly speaking, \Eb\ < \Eg\. 

Lemma 13. If Eq < then L{T^^^) 7 ^ 0. 
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Proof. Since Vij = for all ij G Eg, we have 

i(r'»')= E Eii4”’ii^-Eii4°’ii^= E ii'S’ib-^E ii4“’ib. 

ij&E{G) ij&Eg ij&Ef, ij&E{G) ij&Et, 

By Conditional Ylij^E^ ^ co/r|-E';,| < co^-eon? < Since Condition [H implies |-E(G)| > 

we have Yliij&E{G) ^ Therefore it follows that L{T^^^) >0. □ 

3.2 Proof of Theorem [4] 

Lemmas [2] and [3] will be repeatedly used throughout the proof. Note that these lemmas can be used 
only if the given set of vectors satisfies a certain condition on the angles between them. For each 

distinct ij € E{Kn), define B{ij) as the set of edges k£ G E{Kn) such that y^l — (tic\ 
holds for some distinct a,b,c G {i,j,k,i} satisfying (a, 6 ) ^ {i,j)- Note that Lemmas [2] and [3] can 
be applied to the set of indices {i,j, k,f\ (having size either 3 or 4) for all k£ ^ The following 

lemma shows that B{ij) is small for each ij. 

Lemma 14. For each ij G E{Kn), we have \B{ij)\ < 6ein^. 

Proof. For each ab G E{Kn), define B‘i{ab) as the set of indices c G [n] distinct from a,b for 

which ■\/l — < /? or \J^ 4?)^ < holds. Condition [2] implies \Bs{ab)\ < ein for 

all ab G E{Kn). One can check that ki G B{ij) if and only if one of the following events hold: 
k G B‘i{ij), t G B‘i{ij), k G Bs{ii) U B^{jl), i G Bj,{ik) U B'^{jk). Therefore 

\B{ij)\ < 2\BSj)\ -n+Y, {iBsm + l^3(jT)|) + {jB^m + |B3(ik)|) 

k^i,j 

< 2ei'nf + n • 2 ein + n ■ 2ein = 6 ein^. □ 


We now prove the deterministic recovery theorem in three dimensions. 


Proof of Theorem By Lemma [l3] and the fact that Conditions 1-6 are invariant under global 
translation and nonzero scaling, we can take = 0 and = 1 without loss of generality. 

The variable /r is to be understood accordingly. 

We will directly prove that R{T) > for all T ^ such that L{T) = 1 and t = 0. 

Consider an arbitrary feasible T and recall the notation tij = (1 + 5ij)tf^ + PijSij where Sij is a 
unit vector orthogonal to and rjij = \\P,(o)±tij\\ 2 . A useful lower bound for the objective R{T) 
is given by 


R{T)= E liny 


ijeE{G) 


till 2 


X] 11 


ijeEg 


ijeEb 


ijeEg 


ij&Et, 


-L ^ij 112 


> Y + Y iWVifh - \^ij\\\t\fh - Vij 


f( 0 )| 


/ 2 (T(o)) + ^ rjij - Y 


bllldVll^ 


+ Vij)- 


( 10 ) 


ijeEg 


ij&Et 
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Suppose that Yhij&Ef, \^ij\ 




h Y^ijeE ''lij- Therefore by (fTOll . we have 


< YijeEhVij- Since eo < Lemma [5] implies YijeE^Vij < 


M\ 


R(T) > R{To) + Vij - E 

ij^Eg ij&Eh 

> RiTo) + ^ rjij - ^ 2r]ij > R{To) 

ij&Eg ijeEb 


+ Vij) 


Hence we may assume 


t^\ 


2 > Y1 ^ir 

ij&Eb 


( 11 ) 


E i%i 

ijeEb 

In other words, the total parallel motion is larger than the total rotational motions on the bad 
edges. The key idea of the proof is to show that parallel motions on bad edges induce a large 
amount of rotational motions on good edges. 


In the case \Ei)\ ^ 0, define 5 := ^ 

llhj IL 


as the average ‘relative parallel motion’ 

on the bad edges. For distinct ij,k£ G E(Kn), if {*,j} H {k,£} = 0, then define r]{ij,k£) = 
Vij + Vik + Vie + Vjk + Vji + Vke^ nnd if {i,j} n {k,i} / 0 (without loss of generality, assume I = i), 
then define r}{ij, ki) = r]ij + rjtk + Vjk- 

Case 0. 5 = 0 or \Eb\ = 0. 

Note that 5 = 0 implies 6ij = 0 for all ij G E^,, which by dSD implies r]ij = 0 for all ij G E^. 
Therefore by ([Tj), we have 

R{T) > R{T^^'>) + ^ rj,j. 

ijeEg 

If YijeEg Vij > 0) then we have R{T) > R(T^^^). Thus we may assume that T]ij = 0 for all ij G Eg. 

In this case, we will show that T = 

_ 2 

By Lemma O if eo < then ijij = 0 for all ij G E{G) implies that rjij = 0 for all ij G E{Kn). 
For ij G Fife, since 5ij = rnj = 0, it follows that Since 5ij = lij — for ij £ Eg, we have 


0 = ^ (iij 

ijeE(G) 


C) 


E ««- 4°’) + E (G - 4°’) = E ««- 4°’) = E ««ii‘ 


40) 


( 0 )^ 


40)1 


ij&Eb 


ijeEg 


ij&Eg 


ijeEg 


where the first equality is because L{T) = L(r)^)) = I. By Condition 0 , we have || / 0 for all 
ij G Eg. Hence if Sij / 0 for some ij G Eg, then there exists ab,cd G Eg such that 5ab > 0 and 
5cd < 0. By Lemma [2] or [3] and Condition 6 , this forces r]{ab,cd) > 0, contradicting the fact that 
rjij = 0 for all ij G E{Kn). Therefore 5ij = 0 for all ij G Eg, and hence 6ij = 0 for all ij G E{G). 

Define ti = + hi for each i G [n]. Because rnj = 5ij = 0 for all ij G E{G), we have hi = hj 

for all ij G E{G). Since G is connected (by Condition [1]) , this implies hi = hj for all i £ [n]. Then 
by the constraint U = = 0, we get hi = 0 for all i £ [n]. Therefore T = This 

proves Case 0. 


We may now assume that 5/0. Since £ij — II 2 for ij £ Eg, we have 


0 = {iij 

ijeE{G) 


-O = E(^^.-C)+ 

ij&Eb ijeEg 
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Therefore 



< 

E(««-4°’) 

ij&Eg 


ijeEb 


ij&Ei, 


< 2 5 ; I%II|4“'||2. 

ij&Ef, 


( 12 ) 


where the final inequality follows from ([8]). 

Define E'g = {ij G Eg : ||t -°^||2 > as the set of ‘long’ good edges. Since Y^ij(zEg\E'g < 

^/j.\Eg\, we have 


Eii4“’ib = E ii4’ib-Ei4“’ib- E ii44b 

ijGE'g ij&E{G) ij&Ef, ijeEg\E'g 

> f^\E{G)\ - co/i • \Ef,\ - ^f^\Eg\ > n ■ ^ri^P- 

where the last inequality uses \E{G)\ > {E;,] < eo'n?, \Eg\ < |i4(G)|, eo < By Condition[3l 

we have || 4 j ^||2 < co/i for all ij, and thus it follows that 


\K\> 


16co 


n^p. 


(13) 


Case 1. 5 / 0 and Ylij&E' l%l < 7 ^ 0. 

In this case, we will exploit the fact that there is a difference between average relative parallel 

motions on long good edges and that on bad edges, to show that there is a large amount of rotational 
motion on the K^s of the form {i,j, k,i} where ij G Eb and ki G Eg. Define Lb = {ij G Eb : \5ij\ > 

F)- **■»<: EijeE,\L, I'>«II|4°’|I2 < FSjeE, I|4°42 = 1 EijeE, l'S«lll4°T' Therefore 

E I^«iii4°'iI2 = E I^«III4°'iI2- E I««III4°'iI2>5 E I^«IIi4”’II2- (“) 

ijeLi, ijeEi, ij&Ef,\Li, ij&Ef, 

Define Eg = {ij G Eg : \5ij\ < |(5}. Then by the condition of Case 1, 

ii|£;i>EiFia E iFi> 

^3&E'g ijeE'g\Fg 

and therefore \Eg\Fg\ < ^\E'g\,oT equivalently, \Eg\ > ^\Eg\ > ^^n^P (where the second inequality 
comes from (fT3]) l. 

For each ij G Lb and kl G Eg\B{ij), by Lemmas[2]and[3l we have p{ij, k£) > j\Ske—Sij\\\t[^'^ II 2 > 
Therefore, 

ij&EfjkiaEg ij&Lh kl&Fg\B{ij) ij&Lt, 

By Lemma [Til we know that |il(ij)| < 6 ein^ holds for all ij G E{Kn). For si < we have 
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Therefore 


Y 1 

ij&Eb M&Eg 


> - ■ 

8 64co 




E 

^j&L^ 


l<5< 


ij I 


t^\ 


2 > 


13 


1024co 


n'^p 


E !«•- 

ij&Ef, 




where the second inequality comes from (I14p . 

For each ij € E{Kn), we would like to count how many times each r]ij appear on the left hand 
side. If ij G E'b, then there are at most ( 2 ) K^s and n K^s containing ij\ hence may appear at 
most 6 ( 2 ) + 3n = 3n^ times. If ij ^ then r]ij appears when there is a K/^ or a containing 
ij and some bad edge. By Condition 4, there are at most 2eon such bad iCas. If the bad edge 
in Ki is incident to ij, then there are at most 2eon • (n — 3) such K^^s, and if the bad edge is 
not incident to ij, then there are at most |£'ft| < eon? such K^s. Thus rjij may appear at most 
3 • 2eon + 6 • {2£Qn{n — 3) + Eqv?) < 18eon^ times. Therefore 


E E r]{ij, ki) < ^ 3n^ • r]ij + ^ ISe^n^ 


Vij- 


ijeEbkleEg ijeEb 

2 

If ^0 E then by Lemma [U we thus have 


ij£E{Kr,) 


ijeEt,MeEg ij^Eg ij£E{Kn) ijeE{Kn) 


Hence 


Y ^ Y Y ki) > 


cip" 


Y 


ij£E{Kn) ij&Ef, ki£Eg 


( 0 )|| 

1024co ”"' ^ Il2- 

ij&Et 


If £0 < 16.4^1^2400 ^ Ilien 




> 


fjcip^ 


El««lll4”’ll'^>Il:El« 


42 • 1024coeo ^ ^ 

ij&E{K„) ^ ^ ijGEb 


ClP ^ 

iJ&Eb 


( 0 )|| 
l|2- 


If £0 < , then by Lemma [U this gives 

E-I«£T E W > 2 E I*: '"'™' 


g '-j ' - 

ij&Eg ijeE(Kn) ij&Et, 


Since Lemma [5] implies Ylij^Eg'^lij — ‘^YlijeEi,Pij (given eq < together with the inequality 

above, we have Y.ij&Eg'^ij > + %)• By ([TO]), this shows that R{T) > R{Tq). 

The parameters must satisfy Sq < min{2J|-, ^q.^ 2 E 02 Aco ’ - Tgfe^- 

Case 2. 5 / 0 and Ylij&E'g l%l - \^b\ ^ 0. 

In this case, we first show that there are large amount of positive and negative parallel motions 
on the good edges. This will imply that there is a large amount of rotational motions on the Ki^s 
of the form {i,j,k,i} where ij,ki G Eg and 5ij > 0, < 0. Since ||t ®||2 > lo^' ^ 

Case 2 implies 

Y \^ij\\\4fh> Y \^ij\\\tfh>^k- Y \^ij\ > 

ijeEg ij&E'g ijeE'g 
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Define -E+ = {ij G Eg : 6ij > 0} and E- = {ij G Eg : 6ij < 0}. The inequality above and (fT^ 
implies 




1 


( 0 ), 


ijeE+ 


> 


ij&Eg 

1 / 1 


2 I ^ 

ij&Eb 


( 0 ), 


|2 I > 


1 / 1 


2 V 16 


fi6\E'\ - 2cofiS\Eb\ . 


From m, we have \Egi\ > Therefore if Sq < 77 ^ 5 ^, then 


1024c^ 


(0)|L ^ 1 ..T / 1 ^2 


E ‘'«ii4 112 2 m'*'' 

ijeE+ 

f( 0 )| 


32^ Vl6co 


n p — 32coeora > 


1 


1024co 


p5n^p. 


Similarly T.ij&E.^-^ij)Wij h > To 24 ^^'^™ P- 

We either have |i?+| > \\Eg\ or |F1_| > ^l-Egl- If the former holds, then by Lemmas [2] and O 


^ ^ Y 7(-'^b)ll4° 


f( 0 )| 


ij£E- k££E+ 


ij£E— ki£E^\B{ij) 

4 f ■ E (-%)ii4’ib(i£+i-iB(«)i)- 

ijeE- 


By Lemma [TH we have \B{ij)\ < 6 ein^, and thus \E^\ — |-B(ij)| > ^\Eg\ — Qeiv? > — 

£ 0 ^^) “ 6 ein^. If eo < -^p and ei < j^p, then ITI+I — \B{ij)\ > -^n^p, and the above gives 


E E ’'(y.M) a I ■ E (-<5«)ii4’ib > I ■ 10 ^ 

ij&E- M&E+ ij&E- 


pdn^p^. 


Similarly, if \E_\ > \\Egl then EiieE+ EfcteE- v{ijM) > 

On the other hand since each edge is contained in at most copies of iL 4 and n copies of 

it's (and there are 6 edges in a 1 ^ 4 ), we have 

Y ^ ri{ij,ki) < +3n^ Y Vij<^n^ Y 

ijeE^ kl£E+ ^ ^ ij&E{Kr,) ij&E{Kn) 


If £0 < 


_ /^giP _^ then 

32-3-64-1024-cS’ ^ ^ 


V- 1 

ijeE(Kn) 


f3 


p5n^p^ = 


3n2 64•1024co 




3 • 64 • 1024co 
( 0 ) 




> —cop6\Ei,\ > — ^ \Sij\\\tf^j^ h 

crp cip 


where the last inequality follows from Condition [3l If eo < j then by Lemma [U this implies 




y~! — 26 X!/ ^ ^ I'^iilllLj II2) 

ij&Eg ijeE{Kn) ij^Ef, 
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Therefore from (fTOll and (fTTI) . 

R{T) > RiTo)+ ^ mj- E(I'^*^'II|4°^II2 + %) 

ij&Eg ijeE), 

> RiTo) + 2 ^ - Y1 m,\\\tf^^h = RiTo). 

ijeE), ij&Eh 

The parameters must satisfy Eq < min{^^^, ^p, and ei < ^p. □ 


3.3 Properties of Gaussians in three dimensions 

The first lemma establishes a bound on the average distance between random Gaussian vectors. 

Lemma 15. There exists a positive constant c such that if G is a p-typical graph with vertex set 
[n], then with probability at least 1 — 

\\ti - tj\\2 > ^n'^p. 

ijeEiG) 

Proof. Let u E be a fixed vector. Note that for all j E [n], we have 

lb “ = IblP + PilP “ 

Therefore if {v,tj) < 0, then ||u — tj \\2 > ||tj|| 2 - Further, by the symmetry of Gaussian random 
variables, we know that the distribution of ||tj||2 remains the same even after conditioning on the 
event {v, tf) < 0. Therefore 


IE[||u — tj|| 2 ] > P (^(u, tj) < 0^ • E ||tj ||2 {v,tj) <0 


= ^mjh] 



where the final equality holds since each ||fj||2 is subgaussian with mean -^/S/vr. Fix an index i E [n] 
and let iVj be the neighborhood of i in G. Since G is p-typical, we have |Nj| > ^np. By the analysis 
above, we see that 


E 


Yl 11 ^* b'lb 

j&Ni 





By Proposition 5.10 in Vershynin m on the concentration of subgaussians, there is a constant c 
such that with probability at least 1 — we have YljeNi b* “ ^ ^ Therefore 

by taking the union bound over all indices i E [n], we see that with probability at least 1— 


E ll II i \^ \^ II II 7T- 1 f 2 

\\ti - tj\\2 >2YY If* “ b'lb >^-^np=-n p. 

ijeE{G) ie[n]jeNi 


The second lemma establishes a bound on the angle between random Gaussian vectors. 


Lemma 16. Let x,y be linearly independent vectors. Ifti,t2,-" E are independent 

random Gaussian vecotrs, then with probability 1 — for all but at most jdn vectors ti, we 

have 


/ tj-x y-x ^ 

\||ii-a;|| 2 ’ ||y-x|| 2 / “ 2{\\ti\\l + \\x\\l)' 


26 













Proof. Fix an index i G [n]. Note that 


1 - 


ti - X 


y-x 


- XII2’ \\y - x\\2 


P, 


ti - X 


(y-xp 


> 


\\ti - x\\2 

P{x,y}-^ti\\2 


^{y—x)^ {ti 


\ti - x\\ 


P> 


\ti-x\\l 2{\\ti\\l + \\x\\l) 


\x,y}^ti\\2 


(15) 


Since ti is a random Ganssian vector and x, y are linearly independent, the distribution of ||-P{a;,y}itj ||2 
is that of the absolute value of a standard normal distribution. Therefore ^{\\Pix,y}-'-^i\\2 < 

Let Ij be the indicator random variable of the event that 

\\^{x,y}^'f'i\\'2 < /5- We seen above that E[lj] < Further, since are independent, 

it follows that {li}i 6 [n] are independent. Therefore by Chernoff’s inequality, 

•* fE - vf/*" > ^'9") s 

\ie[n] / 

Hence with probability 1 — there are at most y^/3n + ^/3n < (3n vectors ti for which 

\\P[x,y}-'-'^i\\‘2 < lemma now follows from (flSll . □ 

The next lemma shows that random Gaussian vectors are well-distributed with respect to a 
fixed pair of vectors. 

Lemma 17. There exists a positive real number c such that the following holds for all pairs of 
linearly independent vectors x, y G M^. Ifti, • • • , tn £ are independent random Gaussian vectors, 
then with probability 1 —the set of vectors {ti, • • • ,tn} are with 

respect to {x,y). 

Proof. Let c be a positive real number to be chosen later. We may rotate the vectors so that 
X = {£, 0, X 3 ) and y = {£, 0, ys) for some X 3 , 2 / 3 , £ G M where £ > 0. Note that ||x -|- y \\2 > 2£. Define 
£q = max{l,f'}. It suffices to give an estimate on the probability that 




span{ti—x,ti—y}-^ 


i=l 


cn 

2 ^ - 

4 


holds for all vectors h G {x — y)-^ = {(a, 6 , 0 ) : a,b ^ M} satisfying \\h \\2 = 1 . 

Fix a vector h = (/ii,/i 2 , 0 ) satisfying ||L ||2 = 1. For ti = ti, 2 ,L, 3 )) we have span{ti — x,ti — 

y} = span{{0,0,l),x+ y-2ti} = span{{0,0,l), {2£ - 2tip,-2ti^2,0)}- Hence s = (L,24 -L,i,0) G 
span{ti — x,ti — y}'^, and 


IP. 


span{ti—x,ti—y}-^ 


2 — 


144)1 




iti,2,£- 4i,o) • h 




4iL,2 + (^ - tip)h2\ 

\]tl2 + i^-tip? 
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Assume that hi > h 2 >0, which implies hi > Since is normally distributed with variance 
1, the probability that —1 < tjp < 0 is p for some fixed postive real number p. Conditioned on this 
event and the event that ti ^2 > 0 (note that tip and Up are independent), we have 


span{ti—x,ti—y}-^ 


\hiU,2 + (^ - ti,i)h2\ ^ hiUp 


Therefore 


IP. 


span{ti—x,ti—y}-^ 





Note that for tip > 0, the inequality 


feiq ,2 

\/*i,2+4^0 


j2 

> ^ is equivalent to ^ which is 


equivalent to “ 4 ^) P Since hf > ^ and £0 > we have 


P IIP 


span{ti—x,ti—y}-^ 


2 > > P(t ?,2 > 4 I ti ,2 > 0 ) • ip = g 


for some fixed positive real number q. By considering the indicator random variable of the events 
l|Pspan{ti-x,q-?;}-L (^)||2 > we See by Chernoff’s inequality that with probability 1 — 
there are at least ^ indices i G [n] such that \\Pspan{ti-x,ti-y}-'-W\\‘2 > Note that this implies 


n 

^ ^ ll-^spanlti—x,ii—(^) II 2 

i=l 


qn 1 qn 
T ' ^ “ 44' 


To handle the case of h 2 > hi > 0, note that if tip < 0 and 0 < tip < 1, then 


span{ti—x,ti—y}^ 


\hitip + {i- tip)h2\ ^ {i - tip)h2 ^ J_ ^ - Up 
^1^1 ,2 + ti,lV \/4 + (^ “ ^ \/l + (^ “ 


Since is decreasing in the range x > 0, if Up < —1, then \\Pspan{ti-x,ti-y }-'-{^)\\2 P p- 

Therefore we see as in above that with probability 1 — , there are at least ^ indices i G [n] 

such that \\Pspan{ti-x,ti-y}-'-i^)\\2 > ^ P the remaining cases can be handled analogously. 

Let P be a set of |'27r • vectors uniformly distributed along the circle S 2 = {(x, y, 0) : 

x^ + = 1}. Apply the analysis above to each vector in H and take the union bound to conclude 

that with probability 1 — for all h ^ H, 


n ^ 

l|Pspan{q-x,ti-i/}^ (h) II2 P 


Let h' G S 2 be an arbitrary vector and let /i G P be the vector closest to h'. The distance from h 
to h' along the circle S 2 is at most 27r • and hence ||/i. — /i '||2 < Thus for all i, we have 


■^span{q—x,q—j;}-'-(h ) II2 P 

> 


Pspan{ti—x,ti—y}^^^^\\2 11 ^- ^ ||2 

(h) II2 Sid 
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Therefore 


Z^\\^span{ti-x,ti-y}^[^ )\\2 _ ^ ^ p^- 


Z=1 


Since Iq = max{l,£} < max{l, ||x + y|| 2 }) this implies the lemma for c = 


_ Q 


□ 


By applying the nnion bonnd together with the three lemmas above, we obtain the following 
lemma. 


Lemma 18. There exists c, ^ G M and no G N such that the following holds for all positive real 
numbers e and natural numbers n > uq. Let G be a p-typical graph with vertex set [n] for some 
p satisfying np^ > Clogn. Ifti,--- ,tn G are independent random Gaussian vectors, then the 
following holds with probability 1 — n~^, 

1. For each distincti,j G [n], for all but at mosten indices k G [n], we have ’ ||^-//| | ^ )^ ^ 

64 log n ’ 

2. for all distinct i,j G [n], we have ||ti—fj ||2 < 40-v/logn-/r, where p, = YlijeE{G) Pi—tjlb, 

and 

3. the set is well-distributed along G. 

Proof. Let c be eight times the constant coming from Lemma [T71 For each distinct i,j G [n], define 
Sij = {tk ■ ik,jk G E{G)}. Consider the following events: 

(i) for all i G [n], we have ||ti ||2 < d-^/logn, 

(ii) T.ij&E{G) Wti-'tjh > 

(iii) for each distinct i,j G [n], for all bnt at most en integers /c G [n], we have 1— > 
2(ll4ll^+l|L||^)’ 

(iv) for each distinct i,j G [n], is max{i |T+t■ ||,^j - well-distributed with respect to {ti,tj). 

For a fixed f G [n], since ||fi ||2 follows a distribution with 3 degrees of freedom, standard estimates 
on Chi-squared random variables, such as Lemma 1 in m , give 

Fdliilli > 3 -h 2V3t + 2t^) < e"*". 

Let t = -^Tlogn. If n is sufficiently large, then 2t^ -|- 2\/3f -|- 3 < 16 log n. As a result, P(||ti||| > 

16logn) < e-7iogn_ fjg 22 ce Property (i) holds with probability 1 — by taking the union bound 
over all i G [n]. Property (ii) holds with probability 1 — by Lemma [T5j For a fixed pair 

i,j G [n]. Property (iii) holds with probability 1 — by Lemma fTHl Hence by taking the 

union bound, we see that Property (iii) holds with probability 1 — For a fixed pair 

i,j G [n], by Lemma [T71 and the fact that each pair is contained in at least ^np‘^ triangles, we have 
Property (iv) for the pair i,j with probability 1 — Hence by taking the union bound, 

we see that Property (iv) holds with probability 1 — ^. Thus we see that all four events 

(i)-(iv) simultaneously hold with at least probability 1 — n~^ for sufficiently large n, provided that 
np^ > C logn for sufficiently large C- 

We now show that Properties (i)-(iv) imply Properties 1-3. Note that Properties 1 and 3 
immediately follow from Properties (i), (iii), and (iv). Further, since |£'(G)| < n^p, Property (ii) 
implies 

" = JeM S 

' ^ ijeE(G) ^ 
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Hence by Property (i), we have for all i,j E [n] 


ti - tjh < Wkh + Wtjh < Si/logn < 64|Uv^logre. 


□ 


3.4 Proof of Theorem [2] 

We can now prove the three-dimensional recovery theorem, which we state here again for conve¬ 
nience: 


Theorem 2. There exists tiq E N and c E M such that the following holds for all n > uq. Let 
G{{n],E) be drawn from G{n,p) for some p = n). Take t^^\...tn^ E where 

tf'^ ~ A^(0, laxs) are i.i.d., independent from G. There exists 7 = H(p®/log^n) and an event of 
probability at least 1 — ^ on whieh the following holds: 


For arbitrary subgraphs satisfying max* deg^(i) < yn and arbitrary pairwise direction corruptions 
Vij E for ij E Eh, the convex program ([3|) has a unique minimizer equal to 

for some positive a and for = 4 tl^\ 




i&[n] 


Proof. Let ng be a sufficiently large natural number larger than that coming from Lemma [TSl 
Lemma \V2\ implies G is p-typical with probability 1 — Condition on G being p-typical. 

Let c be the constant from Lemma [THl By applying Lemma [THl with e = ^Yg-^===, with probability 

at least 1 — n“®, we have 


1 . 

2 . 

3. 


For each distinct i, j E [n] satisfying i < j, for all but at most 2en = 


p 


we have 1 - (p^ 


1 ii 


> 


236 log2 


and 1 - 


ti tj 


2l471ogn 


integers k G [n], 


236 log2 


for all distinct i,j E [n], we have ||L — tj\\ < Ody^log n ■ p, where p = Ylij£E{G) II’ 

and 

the set {L}ie[n] is „ -well-distributed along G. 


Thus the probability that G is p-typical and Properties 1-3 listed above holds is at least 1 — n 
Hence we may apply Theorem H with with cq = 64Vlog n, ei = - T^’ = 

218 fog and Cl = yf=- The theorem holds if 


9 c; 9 

^ C^P < ^ 4 

“ 2^3 log^ n ~ 2^® log n log 

Letting 7 from the theorem statement be eg 
when p = log^^® n). 


1 /3cfp^ 

32 •3-64-1024-642 “ 32 • 3 • 64 • 1024 eg' 

note that the condition max, degft(z) < 771 is nontrivial 

□ 


4 Numerical simulations 

In this section, we use numerical simulation to verify that ShapeFit recovers locations in in the 
presence of corrupted pairwise direction measurements. Further, we empirically demonstrate that 
ShapeFit is robust to noise in the uncorrupted measurements. 

Let the graph of observations be an Erdos-Renyi graph G{n,p) for p = 1/2. Let E be 
independent AA(0, Isxs) random variables for z = 1... n. Let ^ For ij E E{G), 
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ShapeFit under corruptions and no noise 



0 0.1 0.2 0.3 0.4 0.5 


Corruption probability (q) 


ShapeFit under corruptions and noise 
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Figure 1: Average recovery error of ShapeFit as a function of the number of locations n and the 
corruption probability q. The data model has n 3d Gaussian locations whose pairwise directions are 
observed in accordance with an Erdos-Renyi graph G(n, 1/2) and are corrupted with probability q. 
White blocks represent an average recovery error of zero over 10 independently generated problems. 
Black blocks represent an average recovery error of 100%. The left panel corresponds to the noiseless 
case u = 0, and the right panel corresponds to the noisy case a = 0.05. 


let 


Vij = 




+ crzi 


with probability q 
with probability 1 — q 


where Zij are independent and uniform over §^. Let Vij = %/||% || 2 - That is, each observation is 
corrupted with probability q, and each corruption is in a random direction. In the noiseless case, 
with fj = 0, each observation is exact with probability 1 — 

We solved ShapeFit using the SDPT3 solver [23l|26] and YALMIP [16]. For output T = {ti}ig[n]) 

define its relative error with respect to as 


T r(o) 

where ||T||f is the Frobenius norm of the matrix whose column are {L}. This error metric amounts 
to an ^2 norm after rescaling. 

Figure [T] shows the average residual of the output of ShapeFit over 10 independent trials for 
locations in generated by p = 1/2, cr G {0,0.05}, and a range of values 10 < n < 80 and 
0 < ^ < 0.5. White blocks represent zero average residual, and black blocks represent an average 
residual of 1 or higher. Average residuals between 0 and 1 are represented by the appropriate 
shade of gray. The figure shows that ShapeFit successfully recovers 3d locations in the presence of 
a surprisingly large probability of corruption, provided n is big enough. For example, if n > 50, 
recovery succeeds even when around 25% of all measurements are randomly corrupted. Further, 
successful recovery occurs both in the noiseless case, and in the noisy case with a = 0.05. 

Figure [2] shows the average residual over 10 independent trials for locations in generated 
by p = 1/2, n = 40, q = 0.2 and a range of values of 10" ® < (T < 10*^. We see that ShapeFit 
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ShapeFit for n = 50, q = 0.2 



Figure 2: Average recovery error of ShapeFit versus the noise parameter a. These simulations 
are based on n = 50 Gaussian locations in whose pairwise directions are observed in accordance 
with an Erdos-Renyi graph G{n, 1/2) and are corrupted with probability q = 0.2. The average is 
based on 10 independently generated problems. 


is empirically stable to noise, with average residuals that are approximately linear in the noise 
parameter a. 
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